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DIAGNOSTIC METHQP 

5 

Background 

The present invention is concerned with using 
expression of the CD44 gene or part of the CD44 gene to 
investigate neoplasia. Such investigation includes 

10 taking a tissue, body fluid or other sample from a 

patient to perform diagnosis, to give a prognosis or 
to evaluate therapy that is already being carried out. 
In particular, the invention provides a simple method 
for carrying out routine screening for neoplasia using 

15 body fluid samples or other samples which can be 
obtained non-mvasively . 

The usual way to diagnose a tumour at present 
is by looking at cells or thin slices of tissue down a 
microscope, a method which is often very effective but 

20 has some important limitations. With a small sample, 
diagnosis can be very difficult and often a large 
number of cells will not be available, or it is not 
desirable or possible to obtain a large sample from 
the patient. In as many as 50% of cases a reliable 

25 diagnosis cannot be given; it may be that there is no 
positive evidence of carcinoma but also no certainty 
that the patient is actually free from carcinoma. 
More invasive investigation is then required to 
establish a diagnosis. 

30 Judgment of prognosis also relies on the 

appearance of cells when viewed under a microscope. 
Generally, the more bizarre-looking the cells in a 
primary tumour, the more likely they are to 
metastasise later on but the correlation is by no 

35 means absolute. It would clearly be an advantage to 
be able to predict more accurately whether or not 
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metastasis is likely to occur in order to judge what 
will be the most effective treatment. 

The human CD44 gene codes for a family of 
variably glycosylated cell surface proteins of 
5 different sizes, the numerous functions of which are 
not yet fully established, but which share epitopes 
recognised by the CD44 monoclonal antibody (mAb) . It 
is known to consist of a standard portion which is 
expressed in haemopoietic cells and many other cell 

10 types and into which the products of additional exons 
may be spliced in various combinations to produce 
different proteins. This is a well recognised 
mechanism in eukaryotes for producing several often 
functionally unrelated proteins from the same gene, and 

15 is known as alternative splicing. 

Two common CD44 isoforms have so far been 
purified and characterised (Stamenkovic et al . 1989), 
namely i) a 90kD form consisting of a central 37kD 
core which is heavily glycosylated and ii) a 180kD 

20 form which has 135 extra amino acids inserted into the 
proximal extra-membrane domain and is even more heavily 
glycosylated. Immuno-cy tochemical and immuno- 
precipitation studies have shown that both are widely 
distributed in many different cells and tissues. The 

25 former is known as the haemopoietic or standard form 

which is present on circulating leukocytes, bone marrow 
cells and numerous other cell types. The other, known 
as the epithelial variant, is detectable on several 
epithelial cell types. Both are believed to function 

30 as receptors mediating homotypic and heterotypic 

adhesive interactions, attaching cells to each other or 
to adjacent extracellular scaffolding. 

Some time ago, some of the CD44 epitopes 
recognised by the mAb Hermes-3 were identified as 

35 constituting the peripheral lymph node receptor 

enabling circulating lymphocytes to recognise and 
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traffic through peripheral lymph nodes. Further mAbs 
to this antigen later became available and Stamenkovic 
el al • (1989) used one of them to clone a cDNA 
sequence coding for the standard form of the molecule 
5 from an expression library in COS cells. They 

additionally found, by Northern blotting, that this 
gene was expressed not only by lymphoid cells, but 
also by a variety of carcinoma cell lines and a 
representative sample of solid carcinomas, amongst 
10 which two colonic carcinomas appeared to express more 
than normal colonic epithelium. 

Birch and colleagues (1991) reported that 
melanoma cell clones which strongly expressed the 80- 
90kD form of the CD44 antigen, recognised by the 
15 Hermes-3 antibody, were substantially more metastatic 
in nude mice than clones which expressed it weakly. 
Sy et al . (1991) described a moderate increase in 
metastatic capability of human lymphoma cells in nude 
mice, after the cells were transfected with the 
20 standard CD44 gene, but not after transfection with a 
construct coding for the epithelial variant. Gunthert 
et frl ■ (1991) obtained results indicating that a 
variant form of the lymphocyte homing receptor, 
recognised by a new antibody raised to the rat CD44 
25 antigen, is required for metastatic behaviour of rat 
pancreatic adenocarcinoma cells. Using this antibody 
they cloned a cDNA sequence corresponding to the 
variant form of CD44 and found that it contained 
previously unidentified exons. Transfection of a non- 
30 metastatic clone from the same cell line with a 

construct designed to over-express this cDNA sequence 
unique to the metastatic counterpart, appeared to 
induce metastatic behaviour (Gunthert et al , 1991). 

In view of these findings it became of 
35 interest to know whether other cultured metastatic and 
non-metastatic human tumour cell lines, of various 
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histogenetic origins, expressed CD44 products 
differentially. The expression of genes in cells or 
tissues can be studied most efficiently and sensitively 
by making cDNA from cellular messenger RNA and 
5 amplifying regions of interest with the polymerase 

chain reaction, using specific oligonucleotide primers 
chosen to anneal preferentially to portions of the cDNA 
corresponding to the gene products. However, 
subsequent work by Hofmann et al . (1991) and the 

10 present applicants using this approach provided results 
which showed that CD44 expression did not regularly and 
reliably correlate with the metastatic capability or 
even tumour forming ability of these cultured cell 
lines in nude mice. At about this time, three separate 

15 groups (Hofmann et al , 1991, Stamenkovic et al . 1991 
and Jackson et al . 1992) published sequence data on 
further splice variants they had found being expressed 
by this gene in various human cell lines. 

20 Tfre Invention. 

The present invention results from a 
surprising discovery resulting from studies examining 
the expression of various parts of the CD44 gene in 
fresh tissue and body fluid samples from patients with 

25 tumours of the breast and colon and from their 

metastases. The results indicate sharp and clear 
differences in CD44 expression between tissues from 
i) metastatic (malignant) tumours, ii) non- 
metastatic locally invasive tumours and benign tumours 

30 and iii) normal tissue. The distinction between 

groups i) and ii) is important for judgment of therapy 
and that between groups ii) and iii) is important for 
early diagnosis and screening. 

The invention therefore provides in one 

35 aspect a method of diagnosis of neoplasia, which method 
comprises analysing the expression of the CD44 gene in 
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a sample. 

In a particular embodiment, the invention 
provides a method of assaying a sample for products of 
the CD44 gene or part thereof which method comprises 
making cDNA from messenger RNA (mRNA) in the sample, 
amplifying portions of the complementary DNA { cDNA) 
corresponding to the CD44 gene or part thereof and 
detecting the amplified cDNA , characterised in that the 
amplified cDNA is used in diagnosis of neoplasia. 

The diagnosis of neoplasia may refer to the 
initial detection of neoplastic tissue or it may be the 
step of distinguishing between metastatic and non- 
metastatic tumours. References to the term "diagnosis" 
as used herein are to be understood accordingly. 
15 Tne method is particularly applicable to the 

diagnosis of solid tumours particularly malignant 
tumours e.g. carcinomas. The sample on which the assay 
is performed is preferably of body tissue or body 
fluid; and not of cells cultured in vitro . The sample 
20 may be a small piece of tissue or a fine needle 
aspirate (FNA) of cells from a solid tumour. 
Alternatively, it may be a sample of blood or urine or 
another body fluid, a cervical scraping or a non- 
invasive^ obtained sample such as sputum, urine or 
25 stool. 

The cDNA may be detected by use of one or 
more labelled specific oligonucleotide probes, the 
probes being chosen so as to be capable of annealing 
to part of the amplified cDNA sequence. 

30 Alternatively, labelled oligonucleotide primers and/or 
labelled mononucleotides could be used. There are a 
number of suitable detectable labels which can be 
employed, including radiolabels. 

Reference is directed to the accompanying 

35 drawings, in which:- 

Figures 1 to 5 are autoradiographs showing 
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the results of various experiments reported below, 

Figure 6 is a map of the CD44 gene showing 
exons, probes and primers. The numbering of the exons 
corresponds to that used by G. R. Screaton et al . 
5 1992), 

Figure 7 is the nucleic acid sequence of Exon 
6 (shown in Figure 6), the corresponding amino acid 
sequence being also shown, and 

Figure 8 is a set of autoradiographs showing 

10 the results of another experiment. 

Figure 6 is a map of the CD44 gene showing 
exons 6 to 14. The basic or standard protein can 
theoretically be modified by the insertion of 
transcripts from any, some, or all of these 9 extra 

15 exons. Exon 6 was unknown at the priority date of this 
patent application, and constitutes a further aspect of 
the invention. Exon 6 is over-expressed in tumours but 
not in normal tissues, and is located in the vicinity 
of exons 7 to 9 . The sequence of exon 6 is given in 

20 Figure 7. It contains 129 base pairs and is flanked on 
the 5 '-side by the standard CD44 sequence, and on the 
3 '-side usually by exon 7. 

In contrast to Exons 9 to 11, the products of 
Exon 6 (the newly-sequenced Exon) are only barely 

25 detectable in samples of normal tissues. This suggests 
that Exon 6 will be of particular value in the 
diagnosis of neoplasia. 

In another aspect, this invention provides as 
new compounds, the nucleic acid sequence of Exon 6 as 

30 shown in Figure 7, characteristic fragments thereof, 

sequences which are degenerated and/or represent allele 
variations, the homologous nucleic acid sequences, and 
probes, primers and other reagents capable of 
hybridising with the sequences or homologues. These 

35 compounds and reagents will all be useful in the method 
described above . 
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Further aspects of this invention include: 
The peptide sequence, corresponding to 
Exon 6 and shown in Figure 7, its allele variations and 
secondary modifications thereof, like phosphorylation 
5 and glycosylation products, and characteristic 

fragments thereof, for example those fragments which 
constitute epitopes when the polypeptide is folded in 
vivo - 

Antibodies to the peptide sequence, its 
10 allele variations and secondary modifications thereof, 
like phosphorylation and glycosylation products and the 
characteristic fragments thereof. Such antibodies may 
be labelled, for example with a radionuclide or with a 
tumouricidal compound. 

1 5 - Use of a labelled antibody for in vitro 

diagnosis . 

- Use of such a radiolabeled antibody for 
radioimaging or in vivo diagnosis. 

- Use of the antibodies, optionally labelled 
20 with tumouricidal compounds or otherwise, in therapy. 

The peptide sequence or fragment can be 
synthesised by standard techniques, e.g. using an 
automatic synthesiser. The antibodies can be made by 
administering the peptide in antigenic form to a 

25 suitable host. Polyclonal or monoclonal antibodies may 
be prepared by standard techniques. 

In a further aspect, the invention provides a 
method for the immunological diagnosis of neoplasia, 
characterised by determining over-expression of an exon 

30 located in the vicinity of exons 7 to 9 of the CD44 
gene. Preferably the exon has the nucleic acid 
sequence shown in Figure 7. 

The chaotic over-expression of multiple 
spliced variants of the CD44 gene in tumours, implies 

35 that a particular exon may or may not be over-expressed 
(or expressed at all) by a particular tissue sample. 
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An immunoassay using an antibody to the peptide 
expressed by any single exon may therefore give 
misleading results. This invention therefore includes 
use, for the immunological diagnosis of neoplasia, of a 
5 mixture of antibodies to two or more, and preferably to 
all nine, of the CD44 exons. 

D et ail ed Des c ri p tio n 

In one embodiment of the invention, the 

10 amplification of cDNA is carried out using the 

polymerase chain reaction (VCR) . For PCR, primers may 
be chosen using known sequence information for human 
CD44 cDNA. Primers may be used that amplify cDNA 
corresponding to any part of the CD44 gene that may be 

15 expressed. This may include the standard portion with 
or without the inserted exons, or it may be part or all 
of one or more of the exons only. In the latter case 
there would be less wastage of reagents and a better 
signal produced, and a probe for the standard sequence 

20 would not be used. 

The invention is not limited to the use of 
straightforward PCR. A system of nested primers may be 
used for example. Other suitable amplification methods 
known in the field can also be applied. 

25 In another method according to the 

invention, the amplified cDNA is separated by 
electrophoresis. Blotting and autoradiography may 
then be performed on the separated cDNA. 
Autoradiography involves probing electrophoresed 

30 amplified products, immobilised by blotting them on to 
a nylon membrane, with a radiolabelled specific 
oligonucleotide probe labelled with 32 P or other 
suitable label, the probe being chosen so as to be 
capable of annealing to part of the amplified cDNA 

35 sequence. The detection step then involves exposure 
of the labelled, separated cDNA to X-ray film. 
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In the examples which follow it was found 
that expression of the human CD44 gene was consistently 
and distinctively increased in various solid tumours 
relative to normal tissues. Malignant (i.e. already 
5 metastatic) tumours differed from locally invasive and 
benign ones in the pattern and magnitude of changes 
seen. The study was performed on samples from 46 
tumours of which 44 were locally invasive, or 
metastatic and 2 were benign. Analysis of CD44 

10 expression was performed by using PCR to amplify cDNA 
made by reverse transcription of RNA extracted from 
fresh surgical biopsy samples. By choosing 
oligonucleotide primers which specifically anneal to 
certain portions of the CD44 gene, it is possible to 

15 amplify portions of the gene which, from these results, 
are of diagnostic and prognostic interest. 

The strong association found here, between 
altered CD44 expression and neoplasia, need not imply 
that any of the individual exons of the gene are 

20 expressed only in neoplasia or in progression to 
metastatic malignancy. Evidence accrued in many 
laboratories in recent years {see Knudson 1985, Tarin 
1992, Hayle et al 1992 for reviews) indicates that 
these pathological processes are probably the 

25 consequences of disturbed regulation of genes coding 
for normal cellular activities such as cell 
proliferation and migration. Therefore it seems 
unlikely that any gene, or portion of a gene, has the 
sole function of programming neoplasia or metastasis. 

30 The finding in the present study of 

transcripts from exon 10/11 in normal tissues, 
indicates that this exon is not exclusively concerned 
with metastatic activity, even though there is marked 
increase in the number and signal intensity of bands 

35 hybridising with radiolabell ed probe E4 in the PCR 
products from tumours capable of metastasis. Other 
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supporting events are therefore believed to be required 
for CD44 exon 10/11 expression to result in metastatic 
behaviour. Nevertheless, the observation that 
transcripts from this exon were over-expressed in 
5 samples from metastatic tumours promises to be a very 
useful indicator of prognosis. 

It is not expected that further research will 
find that the natural (non-mutated) products of any 
individual exon will be uniquely present in tumour 

10 cells and not in normal counterparts. Instead, it is 
likely that an abnormal pattern of gene activity 
consisting of over-expression and inappropriate 
combination of products of a gene, such as that 
reported here for the CD44 locus, could play a part in 

15 malignancy. These changes may themselves be required 
for malignant conversion, or be the consequence of 
other genetic disturbances causing such a conversion. 
Even so, without resolving this issue, an observer 
using these techniques can obtain information relevant 

20 to assigning a sample to neoplastic or non-neoplastic 
categories . 

EXAMPLES 

Method 

25 Fresh tissue samples, 0.5 - 1 cm diameter, 

were obtained from surgical resection specimens removed 
at therapy of 34 patients with breast tumours and colon 
tumours. The samples were snap-frozen in liquid 
nitrogen within ten minutes of arrival in the 

30 pathological specimen reception area and kept in liquid 
nitrogen until use. Portions of lymph node metastases 
and blood-borne metastases were also collected, if 
present, in the tissue resected for diagnosis. Normal 
breast tissue, normal colon mucosa, normal lymph node 

35 adjacent to the tumour in the breast and normal liver 
were also collected from the surgically resected 
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samples and from other samples removed for non- 
neoplastic conditions. Normal peripheral blood 
leukocytes were obtained from 10 volunteers and bone 
marrow from 3 volunteers. The histological features of 
5 the tumours and their clinical stages were as described 
in Table 1 . 

Total cellular RNA extraction from tissue 
samples was performed according to the method described 
by Chomizynski and Sacchi (1987) . Extraction from 

10 fluid samples was by use of the Microf asttrack kit 

marketed by Invitrogen. cDNA synthesis and subsequent 
amplification by the polymerase chain reaction (PCR) 
was performed using the Superscript™ preamplif ica tion 
system (BRL Life Technologies Inc., Middlesex, UK) with 

15 buffers and reagents supplied in this kit. In brief, 
this involves an initial step of first strand cDNA 
synthesis with reverse transcriptase, using sample RNA 
as the template and supplied nucleotide triphosphates. 
For subsequent PCR each sample was overlaid with oil 

20 and heated at 94 *C for 5 minutes to denature nucleic 
acid; 30 cycles of PCR were then conducted with the 
following cycle parameters: 94'C for 1 m, 55'C for 1m, 
72 *C for 2 m. Negative controls in which there was no 
template cDNA in the reaction mix, were routinely run 

25 with each batch. The primers and probe sequences we 

devised, using information from the published sequence 
for human CD44 cDNA (Hofmann et al . 1991, Stamenkovic 
et a_l , 1991, Jackson et al , 1 992) (Figure 6) were as 

follows : 

30 

P1 = 5 'GACACATATTGCTTCAATGCTTCAGC 

P4 = 5 ' G ATGCCAAGATGATCAGCCATTCTGG AAT 

P1 is located with its origin 324bp upstream 

from the insertion site in the standard CD44 molecule 
35 (between nucleotides 782 and 783 in the sequence 

published by Stamenkovic et al . 1989) and P4 is 158bp 
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downstream of this site. These primers produce a PCR 
fragment of 482bp if a sample expresses standard CD44 
(so-called haemopoietic CD44), 878bp for the epithelial 
form of CD44 and several other bands, if a sample 
5 contains alternatively spliced transcripts. 10 ul of 
each PCR product was electrophoresed in a 1.2% agarose 
gel and transferred to Hybond N + (Amersham UK, Little 
Chalfont, UK) nylon membranes for hybridisation with 
oligonucleotide probe E4 { = 5 'TGAGATTGGGTTGAAGAAATC-3 ' ) , 

10 see Figure 6. Blotting and autoradiography were 
performed to improve sensitivity of detection and 
resolution. The probe was radiolabeled with y 32 P-ATP 
in the presence of polynucleotide kinase. After 
prehybridisation, hybridisation was performed in 10% 

15 dextran, 6 x NET, 5 x Denhardt solution, 0.5% NP40 and 
100 ug/ml salmon sperm DNA at 42 *C overnight. The 
filter was then washed twice in 2 x SSC, 1 x S5C and 
0.5% SSC with 0.1% SDS at 42*C sequentially for 15 
minutes each. Filters were exposed to Kodak X-ray film 

20 for 2-16 hours. After this, the filters were boiled in 
0.5% SDS for stripping the probe and rehybridised with 
another radiolabelled probe, namely P2 
( = 5 ' CCTGAAGAAGATTGTACATCAGTCACAGAC ) we designed to 
anneal to the standard portion of the CD44 (Figure 6). 

25 The conditions used for hybridisation, washing and 
autoradiography were the same as above. 

Calibration of the sensitivity of the 
method, for detection of small numbers of cells was 
performed as follows: total peripheral blood 

30 leukocytes (PBL) were purified from 20ml whole blood by 
lysis of packed red blood cells by addition of 
ammonium chloride buffer (1ml packed cells to 50mls 
lysis buffer) and subsequent centri f ugation 15 minutes 
later. The white cell pellet was divided into 4 

35 tubes which were seeded respectively with Oul, lul, 

lOul and 1 OOul of a suspension of HT29 colon carcinoma 
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cells (5000 cells per ml). Total RNA was then 
extracted and each tube yielded approximately 20pg. 
cDNA synthesis was performed, as described above on 
4pg aliquots of the RNA obtained from each tube 
5 representing 0, 1, 10 and 100 tumour cells per 

aliquotted sample respectively. The PCR was performed 
on these samples and on positive (tumour cells only) 
and negative (no DNA) controls using primers D1 and D5 
which were designed by us to anneal specifically to 

10 exons 7 and 14 in Figure 6. We know from previous 
studies that HT29 cells express both exons, and 
others, in a pattern easily distinguishable from PBL 
and chose the oligonucleotide primers D1 and D5 
because we wished to increase sensitivity by 

15 shortening the segment to be amplified. It was also 
reasoned that use of these primers would circumvent 
the problem of using primers P1 and P4 for this 
specific purpose because the majority of these would 
be soaked up by annealing to the standard portion of 

20 the gene. PCR cycle parameters, blotting, probing and 
washing conditions were as described above. The 
oligonucleotide sequence used for probing was 32 P 
labelled E4 . 

2 5 General Overview of Results 

As the primers (P1 and P4) amplify across the 
splice product insertion site it is clear that the 
intervening part of the standard molecule will be 
amplified, in addition to any alternatively spliced 

30 variants which contain transcripts from the additional 
exon domains. Hence the total number of products which 
could conceivably be detected with a probe (e.g. P2) to 
the standard form considering all possible combinations 
of the sequences identified from this locus, is large. 

35 Using probe E4 , 16 of these combinations, namely those 
containing E4 transcripts from exon 1 - , could 
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potentially be visualised as bands of different 
molecular sizes resolved by electrophoresis. In 
practice the full range of possible combinations was 
not detected in these results, but several (up to 9) 
5 alternative splice variants were seen in neoplastic 
tissues hybridised with each probe. Normal tissues 
from the breast, colon and lymph nodes did express some 
E4 -containing transcripts (Figures 1 and 3), in 
addition to the standard molecule (Figures 2 and 4), 
10 but peripheral blood leukocytes (Figure 5) and liver 
(Figure 4) detectably expressed only the latter with 
this combination of probes and primers. The details 
are presented below: 

15 EXAMPLE 1 

Breast Tissue Samples 

The results obtained in the study of breast 
tissue samples are illustrated in Figures 1 and 2. 
Metastatic tumour deposits and their corresponding 

20 primary tumours from all cases over-expressed several 
alternatively spliced products containing transcripts 
from exon 11 (Figure 1a). At least 8 separate bands 
were frequently seen together with a consistent doublet 
at 1500bp and 1650bp present in all tumours. Normal 

25 breast tissue and normal lymph node produced two bands 
( 1 1 50bp and 860bp) with this probe. The doublet 
mentioned above was not seen in any normal sample. 

The differences between the number, and size 
of the bands and the intensity of signal from the bound 

30 probe, between tissues in normal and malignant 

categories, was obvious in all samples examined. For 
occasional samples it was necessary to expose the 
filter to the X-ray film for longer, to see the 
distinctive differences, but this finding was confirmed 

35 in every case studied. 

Samples from locally invasive tumours with no 
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associated clinical evidence of metastasis and from the 
two fibroadenomas also over-expressed splice products 
containing transcripts from exon 10/11 relative to 
normal tissues, but the extent of this was easily 
5 distinguished from the results obtained with malignant 
tumours and their metastases. Distinction from the 
patterns seen in normal tissues was also easy (Figure 
lb) . However, a single sample gave a similar result to 
malignant tumours {lane 14) (see below). The two 

10 fibroadenomas showed band patterns that were similar to 
those from non-metastatic carcinomas and the sample 
from a case of cystic disease of the breast resembled 
the pattern for normal non-neoplas tic breast tissue. 
This is the first instance of definitive diagnosis by 

15 this method. The piece of tissue was provided by the 
duty pathologist as being from a benign tumour, namely 
a fibroadenoma, on macroscopic appearance at initial 
inspection with the naked eye. It was then 
characterised as definitely non-neoplastic after PCR 

20 amplification of its cDNA, and subsequent microscopical 
examination of the tissue confirmed this. 

To confirm that the differences seen with 
probe E4 are valid and not technical artifacts, the 
results obtained when the same filter was hybridised 

25 with probe P2 are shown in Figure 2. This shows that 

i) all tissues examined expressed the standard form of 
the gene, ii) other exon splice products, not 
containing transcripts from exon 10/11, were present in 
tumours and metastases and iii) that the differences 

30 described above are not due to unequal loading of 
tracks in the various panels and lanes on this 
composite filter, but resulted from alternative 
splicing. All conditions in this experiment were the 
same as those in hybridisation with E4, except the 

35 exposure time of the filter to X-ray film (10 hours 

exposure for Figure 1, versus 1.5 hours for Figure 3). 
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Colon Samp les 

The findings in colon carcinoma were 
identical to those in breast carcinoma. Thus, in all 
5 cases the colon carcinoma tissues showed increased 
number of more intensely labelled, larger molecular 
weight bands with probe E4 (Figure 3) than normal 
colonic mucosa and other normal tissues. As with 
breast carcinomas, hybridisation with probe P2 showed 
10 no differences in the degree of expression of the 
standard form of the molecule (Figure 4). 

EXAMPLE 3 

Calibration of the Sensitivity of the Metho d 
15 Examination of autoradiograms of PCR 

products of peripheral blood leukocytes seeded with 
known numbers of HT29 colon carcinoma cells showed the 
presence of additional bands characteristic of tumour 
cells, down to a level of 10 tumour cells in a sample 
20 of 10 7 leukocytes. By fine-tuning the conditions of 

the assay it is considered possible to detect a single 
tumour cell in 10ml of blood. 

In the series described above, all samples of 
25 neoplastic tissue showed over-expression of 

alternatively spliced products of the CD44 gene and 
none of the samples from non-neoplas tic tissue did so. 
Therefore, there was complete correspondence between 
normal or neoplastic origin of a sample and pattern of 
30 CD44 expression. In one instance, a tumour removed 

from a patient {patient B16, lane 14 in Figure 1A) with 
no current clinical evidence of metastasis, was found 
to have a pattern of expression indicating metastatic 
capability. At present it is not possible to know 
35 whether this is a false positive result, or a sign of 
imminent metastasis. This patient is currently under 
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observation in the follow-up clinic. 

EXAMPLE 4 

We have designed and synthesised 
5 oligonucleotide primers according to our current 
findings, as follows: - 

Primer P1 = 5 ' -GACACATATTGCTTCAATGCTTCAGC (458-484) 
Primer P2 = 5 ' - CCTGAAGAAG ATTGTACATCAGTCACAGAC (488-518) 
Primer P3 = 5 ' -TGGATCACCGACAGCACAGAC (746-767) 

10 Primer P4 = 5 ' -GATGCCAAGATGATCAGCCATTCTGGAAT (912-941) 
for standard part (Stamenkovic 1989) 
Primer E1 =5 ' -TTGATGAGCACTAGTGCTACAGCA 
Primer E2 = 5 ' -CATTTGTGTTGTTGTGTGAAGATG 
Primer E3-5 ' - AGCCCAGAGGACAGTTCCTGG (534-554) 

15 Primer E4 = 5 ' -TGAGATTGGGTTGAAGAAATC (558-578) 

Primer E5=5 ' -TCCTGCTTGATGACCTCGTCCCAT (585-608) 

D1 : 5' GAC AGA CAC CTC AGT TTT TCT GGA (63-B6) 
D5 : 5 ' TTC CTT CGT GTG TGG GTA ATG AGA (888-911) 
for the exons (Hofmann 1991). E1 and E2 are on exon 6. 

20 Fresh tissue samples 0.5-1 cm in diameter 

were obtained from surgical resection specimens or at 
autopsy. All samples used in this work were 
obtained from the residue of tissue remaining after 
diagnostic samples had been taken, and which would 

25 otherwise have been discarded. The samples were snap- 
frozen in liquid nitrogen within ten minutes of arrival 
at the pathological specimen reception area and kept 
frozen in nitrogen until use. cDNA was synthesised 
with viral reverse transcriptase using 5 ug of total 

30 cellular RNA as template, followed by PCR with 
Primer PI and Primer P4 . PCR amplification, 
electrophoresis and hybridisation were performed under 
standard conditions . 

When the PCR products were hybridised with 

35 radiolabeled E2 or E4, all samples from carcinomas 

over-expressed several splice variants, but the pattern 
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of bands seen with each probe was different. Hence, 
the oligonucleotide probe for Exon 6 products is very 
effective in distinguishing neoplastic from non- 
neoplastic samples, but not significantly more 
5 sensitive than E4 , at least on samples from solid 

tissues, but is possibly useful for detecting organ of 
origin of a disseminating metastatic cell or an 
established metastasis. Subsequently, the same filters 
were stripped and hybridised with P2 probe to show that 

10 all samples, including normal tissues, produced the 
standard portion of CD44 . This confirmed that the 
differences observed between the results obtained with 
normal and tumour samples, probed with E2 and E4 , were 
not due to unequal loading of PCR products. The 

15 cumulative results are summarised in Table 3 which 

indicates that these changes are seen in a wide range 
of common cancers . 

Table 3 

20 



30 



Type of Tissue 


No. of Patients/ No. 


Showing Increased 




Volunteers 


Splice Variants 


Neoplastic 


47 


46 


Breast Cancer 


21 


21 


Colon Cancer 


13 


13 


Bladder Cancer 


b 


6 


Stomach Cancer 


1 


i 


Thyroid Cancer 


1 




Fibroadenoma 


2 


2 


Prostate Cancer 


3 


n 


Non-Neoplastic 


39 


0 


Normal Breast 


9 


Ci 


Cystic Disease of 


Breast 1 


0 


Normal Colon 


q 


0 


Crohn's Disease' 


l 


0 


Ulcerative Colitis 


1 


0 


Append i c 1 1 i s 


! 


0 


Normal Bladder 


L 


0 


PBI. 


10 


0 


Bone Marrow 


3 


0 
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We have also examined some malignant tumours 
of bone muscle and observed a similar pattern, of 
marked over-expression of multiple spliced variants, in 
the osteosarcoma. 

5 

EXAMPLE 5 

Cancer Diagnosis fry PCR Assay of Clinically-Harvested 
Urjpe Samples, 

Approximately 50 ml naturally-voided urine 

10 were obtained from each person and transported to the 
laboratory as speedily as possible. Specimens from 90 
patients were examined: 44 from patients with biopsy- 
proven bladder cancer, 46 from patients with non- 
neoplastic inflammation of the bladder (cystitis) and 

15 from normal volunteers. One ml of each urine sample 
was removed after thorough mixing and submitted for 
cytological examination. Another 1 ml of urine was 
checked by Fluorescein diacetate-ethidium bromide 
staining to assess the viability of cells in the 

20 sample. The remainder of the urine was centrifuged at 
2000rpm for 10 minutes and the cell pellet was kept at 
-70 "C until use. mRNA extraction was performed with 
oligo dT cellulose tablets (invitrogen) . cDNA was 
synthesised with AMV reverse transcriptase 

25 (Invitrogen) . The completed cDNA solution was divided 
equally into two tubes, one being for PCR with E1 and 
E5, to amplify the particular cDNA transcript, which we 
have found to be of diagnostic value and the other for 
PCR with PI and P4 to amplify the standard form of 

30 CD44, with or without all splice variants, as an 
internal control. 

Thirty-five cycles PCR were then carried out. 
The cycle conditions were: 95'C 1 minutes, 55*C 1 
minute, 72 'C 2 minutes. A 'hot start' procedure was 

35 adopted for all samples. Results are shown in 
Fi gure 8 . 
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Equal volumes of PCR products were loaded in 
each lane of a 1.2% agarose gel and stained with 
ethidium bromide. If the cells in the urine were to be 
expressing all the Exons from Exon 6 to Exon 14, it was 
5 predicted that with the current PCR protocol, using 

primers E1 and E4 , should produce a 735 bp band. There 
is no band in tracks containing cDNA from normal urine 
or that of patients with non-neoplastic cystitis (lanes 
1-8) but a clear 735 band is seen in all urine samples 

10 from patients with bladder cancer (lanes 9-16) when PCR 
was performed with primer E1 and E5 (upper panel) . 

A 482 bp band representing the standard form 
of CD44 was obtained almost equally in all cases when 
PCR was performed with P1 and P4 (lower panels) . This 

15 indicates that the diagnos tically significant 

differences between urine from patients with bladder 
cancer and that from controls were not caused by 
unequal loading of the tracks but by alternative 
splicing of the CD44 gene. Lanes 1-4: normal urine. 

20 Lanes 5-8: cystitis urine. Lanes 9-16: from patients 
1-8 with bladder cancer. 

In the overall results this 735 bp band was 
completely absent in 7 of 7 normal and 9 of 9 cystitis- 
affected urine specimens; that is 0% false positive. 

25 Also 14 of 19 (74%) urine samples from patients with 

bladder cancer showed a positive result (i.e. 26% false 
negatives) . In the false negative samples there was a 
shortage of viable cancer cells as indicated by 
f luorescem-d acetate ethidium bromide staining. 

30 

EXAMPLE 6 

Stools from 12 patients were assayed by the 
techniques described herein. Of the samples from 9 
patients with colorectal carcinoma, 5 gave positive 
35 results. Of the samples from 3 normal patients, all 3 

gave negative results. These figures, obtained frorr. 



WO 94/02633 



PCT/GB93/01520 



samples full of bacteria which were not subjected to 
any pre treatment , encourage the belief that a viable 
diagnostic assay could be developed without difficulty. 
In the inventors ' further experience of 
5 detecting tumour cells with this method, the following 
observations would be useful to others investigating 
its diagnostic potential. The major considerations to 
be aware of are that the reliability and 
reproducibility of the results depend critically on the 

10 quality of the mRNA obtained from the sample and upon 
the care with which the techniques are performed. The 
main requirement is to eliminate false negative results 
by ensuring that high quality mRNA is routinely 
obtained and by using internal standards in every 

15 reaction to monitor the PCR amplification step. False 
positives, providing they are not too frequent, are not 
a serious problem, because they can be recognised by 
replicate assays on the same or further samples and by 
reference to other clinical data. 

20 The inventors have explored the procedures 

needed to ensure the routine RT-PCR detection of 
abnormal CD44 gene activity in small clinical samples 
containing tumour cells. If a tissue sample is divided 
into aliquots, half of which are frozen in liquid 

25 nitrogen immediately and the remainder of which are 
left at ambient temperature, one can show how the 
ability to detect CD44 splice variants declines with 
time and with mode of specimen handling. Fresh samples 
submitted to mRNA extraction within half an hour of 

30 excision give the most reliable results and there is a 
gradual decline in quality over the next few hours if 
the fresh tissue is left at ambient temperature. If 
the sample is first snap frozen, the results obtained 
when RNA is extracted immediately after thawing are 

35 satisfactory, but decline very rapidly, beginning 

within 15 minutes, the larger variant transcripts being 
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lost first and ultimately even the standard form. It 
is also found that if snap-frozen cell and tissue 
samples are stored at -70 "C the results decline after 4 
weeks, even if the mRNA is extracted immediately after 
5 thawing. It would seem therefore that degradation of 
RNA by ribonucleases released from cells ruptured 
during freezing continues, even at this temperature, 
although at slower rates. Further, as one would 
expect, if the sample taken for RNA extraction is from 
10 an area of necrosis or of fibrosis, one does not obtain 
the typical results seen with viable tumour tissue. 
Hence, care in sample selection and in specimen 
processing are both needed for generating reliable 
data . 

15 Arising out of this, we prefer that a fresh 

sample should be held for not more than 24 hrs before 
being either frozen or treated to extract mRNA; and 
that a thawed sample should be held for not more than 
2 hrs before being treated to extract mRNA. 

20 The diagnosis method described herein can be 

performed in a single day, possibly in a few hours, and 
is capable of being automated. Use of the method has 
been demonstrated, on various tissue samples to detect 
a whole variety of cancers, and also on blood and urine 

25 samples. We therefore offer it as a convenient 

practical method for cancer screening and diagnosis. 
In principle it could also have wide general 
applicability to cancer detection and prevention 
programmes and therefore have epidemiologic and public 

30 health value. Proper application of its sensitivity, 
specificity and simplicity should add not only to 
initial cancer diagnosis but to evaluation of extent of 
disease in the body, to judgment of the efficacy of 
treatment and to early detection of tumour recurrences. 



35 
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FIGURE LEGENDS : 

Notation: N = normal, T = primary tumour, 
M - metastasis. 

5 Figure 1 

Autoradiograra of PCR products from breast 
tissue samples probed with E4 (10 hours exposure of X- 
ray film to sample filter) . Panel A: malignant 
primary breast carcinomas with their metastases. 

10 Tracks 1, 2 and 3: patient B1 ; tracks 4, 5 and 6: 

patient B2 ; tracks 7, 8 and 9: patient B3; tracks 10 
and 11: patient B4 ; tracks 12 and 13: patient B5 . 
It can be seen that compared to normal breast tissue, 
primary breast carcinomas and their metastatic deposits 

15 overexpress several splice-variants. Note the doublet 
(arrows) at 1500bp and 1650bp best seen in track 5. 
This is present in all tumours and metastases but is 
fogged in the other tracks by this time of exposure. 
It is not detectable in any normal samples even at much 

20 longer exposure times (23 hours). Panel B: Breast 
carcinomas with no clinical evidence of metastasis. 
Tracks 14-20 are from patients B15-B21. The tumours 
all overexpress several variants, but show less bands 
and the signal intensity is less, except track 16 

25 (patient B17) - see text. The 1500/1650bp doublet 

(arrow) is easily recognisable in tracks 15, 16 and 18 
at this length of exposure and became detectable in all 
other tumour-containing tracks on longer exposure. The 
illustration, however, shows only the shorter exposure, 

30 to avoid fogging the tracks which have stronger 

signals. Panel C: Fibroadenomas (FA) and fibrocystic 
disease of the breast (Cyst) . Tracks 21 and 22, 
containing the benign tumour samples (samples B22 and 
23), express more than the non-neoplastic sample 

35 (fibrocystic disease) in track 23 (sample B24) . 
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Figure 2 

Autoradiogram of PCR products from breast 
tissue samples probed with probe P2 {1.5 hours exposure 
of X-ray film to sample filter) . This result was 
5 obtained by reprobing the same filter as that used in 
Figure 1, after stripping off the previous probe. Here 
it can be seen that i) the differences observed in 
Figure 1 are not due to unequal loading of tracks, 
ii) that the expression of the standard form of the 

10 molecule is quantitatively greater than any of the 

variants, iii) the standard form is expressed in all 
tissues examined and iv) further variants which do not 
contain exon 3 transcripts, are also present and 
over-expressed in tumours. The 1 500/1 650bp doublet can 

15 be recognised in the tumours in panel A but needed 
longer exposure to be detectable in panels B and C. 

Figure 3 

Autoradiogram of PCR products from colon 
20 tissue samples probed with E4 (10 hours exposure of 

photographic film to sample filter). Tracks 1, 2 and 
3: patient CI; tracks 4, 5 and 6: patient C2; 
tracks 7, 8 and 9: patient C3; tracks 10 and 11: 
patient C4; tracks 12 and 13: patient C5 ; track 14: 
25 normal liver sample. The picture shows the same 

features as described in the legend to Figure 1 and 
that the findings apply to carcinomas of the colon. 
The 1500/1650bp doublet (arrow) is easily recognisable 
in several tumour tracks (2 and 8-12) and the faint 
30 signal in the corresponding position in tracks 3, 5, 6 
and 13 became stronger on longer exposure. However 
none appeared in this vicinity in tracks 1, 4, 7 or 14 
(normal tissue) . 



35 
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Figure 4, 

Autoradiogram of PCR products from colon 
tissue samples probed with P2 (1.5 hours exposure of 
photographic film to sample filter) . This confirms 
5 equal loading of the tracks and that other points, 
illustrated in Figure 2, apply to colon carcinomas. 
Note that normal liver expresses the standard form of 
CD44 . 



10 Figure 5 

Autoradiogram of PCR products of normal 
peripheral blood leukocytes, PBL (from 3 different 
persons) and other normal tissues probed with E4 (panel 
A; 8 hours exposure to photographic film) and P2 (panel 

15 B; 5 hours exposure to photographic film) . Track 6 

contains PCR products from a breast cancer (patient B1) 
as a positive control. With this combination of 
primers and probes, leukocytes can be seen to express 
the standard form of the CD44 molecule, but no 

20 detectable splice variants. The samples in tracks 4 
and 5 were from individuals with no clinical evidence 
of neoplasia, as follows: track 4, breast tissue 
obtained at autopsy from the body of a woman who died 
of bacterial endocarditis; track 5, colon resected for 

25 volvulus. 
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CLAIMS 

5 

"1 ■ A method of diagnosis of neoplasia, which 

method comprises analysing expression of the CD44 gene 
in a sample. 

2. A method as claimed in claim 1 wherein the 

10 sample is assayed for products of the CD44 gene or part 
thereof by a method comprising making complementary DNA 
(cDNA) from messenger RNA (mRNA) in the sample, 
amplifying portions of the cDNA corresponding to the 
CD44 gene or part thereof and detecting the amplified 

15 cDNA. 

3. A method as claimed in claim 1 or claim 2, 
wherein the sample is from a tissue which may be a 
solid tumour or from the blood or other body fluid. 
4- A method as claimed in claim 1 or claim 2, 

20 wherein the sample is non-invasively obtained. 

5. A method as claimed in any one of claims 1 
to 4, wherein a labelled specific oligonucleotide 
primer or probe is used in detection of the cDNA. 

6- A method as claimed in any one of claims 2 to 

25 5 , wherein the amplification is carried out using the 
polymerase chain reaction. 

7. A method as claimed in any one of claims 2 to 

6, wherein the amplified cDNA is separated by 
electrophoresis prior to detection. 

30 8 . A method as claimed in claim 7, wherein 

blotting and autoradiography are performed on the 
separated cDNA. 

9. The exon of CD44 which is over-expressed in 

tumours but not in normal tissues and which is located 
35 in the vicinity of exons 7 to 9 . 
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1 0. The exon of CD44 having the nucleic acid 
sequence shown in Figure 7, characteristic fragments 
thereof, sequences which are degenerated and/or 
represent allele variations, the homologous nucleic 

5 acid sequences, and probes, primers and other reagents 
capable of hybridising with the sequences or 
homologues . 

11. Use of the compounds and reagents claimed in 
claim 9 or claim 10 in the method of any one of claims 

10 1 to 8. 

12. The peptide corresponding to the CD44 exon of 
claim 10 and whose sequence is shown in Figure 7, its 
allele variations and phosphorylation and glycosylation 
products, and characteristic fragments thereof. 

15 13. Antibodies to the peptide, its allele 

variations, glycosylation products and characteristic 

fragments of claim 12, and fragments thereof. 

14 ■ Antibodies as claimed in claim 13 which are 

labelled. 

20 15. Use of radiolabeled antibodies according to 

claim 14 for radioimaging or in vivo diagnosis. 
16 • Us © of the antibodies according to claim 14, 

labelled with tumouricidal compounds, in therapy. 

17 - A method for the immunological diagnosis of 
25 neoplasia, characterised by determining over-expression 

of an exon located in the vicinity of exons 7 to 9 of 
the CD44 gene. 

18 - A method as claimed in claim 17, wherein the 
exon has the nucleic acid sequence shown in Figure 7. 

30 19. A method as claimed in claim 17 or claim 18, 

wherein there is used a mixture of antibodies including 
an antibody to exon 6 . 

20 ■ A method as claimed in claim 19, wherein the 

antibodies are selected from antibodies to exons 6 to 
35 14. 
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Fig. 8 
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STANDARD 


New Exon 







GCTACCACTTTGATGAGCACTAGTGCTACA 
T L M S T S A T 

GCAACTGAGACAGCAACCAAGAGGCAAGAA 
ATETATKRQE 

ACCTGGGATTGGTTTTCATGGTTGTTTCTA 
TWDWFSWLFL 

CCATCAGAGTCAAAGAATCATCTTCACACA 
PSESKNHLH1 

ACAACACAA ATGGCTGGTACG 
T T Q M A 
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